TITLE OF THE INVENTION 

Semiconductor Memory Device Storing Part of Program Designated 
by Programmer, and Software Development Apparatus for System Using 
the Same 

5 BACKGROUND OF THE INVENTION 
Field of the Invention 

The present invention relates to a semiconductor memory device 
used together with a CPU (Central Processing Unit), and more particularly 
to a semiconductor memory device storing part of a program designated by 
10 a programmer and outputting an instruction code when the CPU fetches 
the instruction, and a software development apparatus for a system using 
the same. 

Description of the Background Art 

Recently, as the processing speed of CPU has been increasingly 

15 accelerated, a CPU mounting a cache memory thereon has been actively 

developed. Generally, a cache memory is a small-sized, high-rate memory 
device which is connected between the CPU and a main memory and which 
temporarily holds the content of the main memory recently used by the 
CPU. Caches are classified into three tji^es: "instruction cache" when a 

20 content to be held is an instruction from the CPU; "data cache" when the 
content is data; and "unified cache" when the content is both the 
instruction and the data. 

The cache has a mechanism which utilizes a feature that the CPU 
locally accesses a memory, and is well known that the cache generally has 

25 an effect of improving the processing performance of the CPU, The efifect 
is, however, dependent on a program performed by the CPU. For example, 
if a program performed by the CPU has a characteristics that there is no 
ordinality in memory access, cache hit rate lowers and the cache cannot 
improve the processing performance of the CPU. 

30 In addition, since the cache inherently has a mechanism of detecting 

the consistency of a reference address outputted from the CPU with the 
content of the cache, the power consumption of the cache tends to increase. 
Due to this, even if a cache is introduced to the CPU, CPU power efficiency 
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(processing performance/power consumption) does not always improve. 

Furthermore, the content of the cache depends on the program 
performing history of the CPU. For this reason, even if the CPU accesses 
the same address, the CPU sometimes hits the cache and sometimes fails to 
5 hit the cache. Access cycles for instructions and data cannot be 

guaranteed. As a result, it is difficult for a system which uses a cache to 
optimize software having high real-time characteristics. 

In order to solve these problems, the following techniques are 
disclosed. 

10 1. Japanese Patent Laying-Open No. 10-340226 

2. Japanese Patent Laying-Open No. 9-319657 

3. USP 5,381,533 

4. P. R. Panda et al., *TEfificient Utilization of Scratch-Pad Memory 
in Embedded Processor AppUcations", European Design and Test 

15 Conference, March 1997 

According to Prior Art 1, a tag memory is divided to a first tag 
memory which includes a common bit group through respective ways in an 
address tag, and a second tag memory which includes individual bit groups 
for respective ways. An address indicated from a data processor is divided 

20 and the divided addresses are compared with each other for each of the first 
and second tag memories, thereby decreasing the power consumption of a 
microprocessor without lowering hit rate. Although Prior Art 1 
contributes to improving power efficiency, it does not contribute to 
improving the performance of the CPU when the CPU performs a program. 

25 According to Prior Art 2, an instruction cache is constituted of one 

line and one instruction, and an instruction stream buffer is provided 
between the instruction cache and a main memory. The instruction 
stream buffer is integer-mxdtiple times as large as the cache line, and 
consecutive variable-length instructions read firom the main memory can be 

30 written to the instruction stream buffer. Output to the instruction cache 
can be performed on each cache line unit. Although Prior Art 2 can 
eliminate useless instruction reading, it cannot improve power efficiency 
and cannot guarantee an instruction access cycle. 
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According to Prior Art 3, each of instruction trace segments includes 
instruction blocks, the first instruction in each block becomes the next 
instruction to a branch instruction, and the blocks are arranged so that the 
last instruction of each block becomes a branch instruction. It is thereby 
5 possible to improve hit rate and power efficiency. However, similarly to 
Prior Art 2, an instruction access cycle and a data access cycle cannot be 
guaranteed. 

According to Prior Art 4, a small-sized, high-rate scratch-pad 
memory (hereinafter, abbreviated as "SPM') is arranged in an address 

10 space different fi:om a main memory. A programmer designates an 
instruction or data to be frequently accessed, and the designated 
instruction or data is held in SPM. As a result, the number of access 
cycles to SPM is guaranteed and a consistency detection function which is 
normally included in a cache becomes unnecessary, thus making it possible 

15 to decrease power consumption. Generally, however, the SPM is used as a 
data memory, i.e., used in place of a data cache, and difficvdt to mount as 
an instruction memory. 
SUMMARY OF THE INVENTION 

It is an object of the present invention to provide a semiconductor 

20 memory device capable of guarantee of an access cycle during instruction 
fetch and capable of improving the power efficiency of a processor system, 
and a software development apparatus capable of efficiently developing 
software for a processor system which includes the semiconductor memory 
device. 

25 According to one aspect of the present invention, a semiconductor 

memory device connected between a processor and an instruction cache, 
includes: an instruction bxiffer storing part of consecutive instructions 
performed by the processor; an address table setting therein an address 
range of the consecutive instructions stored in the instruction buffer; a 

30 determination unit determining whether an instruction address outputted 
from the processor fall within the address range set in the address table; 
and a selector selectively outputting an instruction code stored in the 
instruction bxiffer and an instruction code stored in the instruction cache in 
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accordance with a determination resxilt of the determination unit. 

The selector selectively outputs the instruction code stored in the 
instruction bxiffer and that stored in the instruction cache in accordance 
with the determination result of the determination unit. Therefore, in the 
5 case where the processor fetches an instruction stored in the instruction 

buffer, an access cycle is guaranteed to thereby make it possible to facilitate 
. the development of software having high real-time characteristic. In 
addition, since the operation of the instruction cache is stopped during the 
instruction fetch, it is possible to improve power efficiency. 

10 According to another aspect of the present invention, a software 

development apparatus for a processor system including a semiconductor 
memory device connected between a processor and an instruction cache and 
storing part of consecutive instructions performed by the processor, 
includes: an extraction section extracting the consecutive instiructions 

15 arranged in the semiconductor memory device from a source file; an 

acquisition section acquiring address information on addresses at which the 
consecutive instructions extracted by the extraction section are originally 
mapped; a reformatting section reformatting the address information 
acquired by the acquisition section into a form compatible with the source 

20 file; and a generation section generating a load modvile from the source file, 
the consecutive instructions extracted by the extraction section, and the 
address information reformatted by the reformatting section. 

The extraction section extracts the consecutive instructions arranged 
in the semiconductor memory device from the source file. The generation 

25 section generates the load module from the source file, the consecutive 
instructions extracted by the extraction section, and the address 
information reformatted by the reformatting section. Therefore, a 
programmer can easily develop software for a processor system including 
tiie semiconductor memory device. 

30 The foregoing and other objects, features, aspects and advantages of 

the present invention will become more apparent from the following 
detailed description of the present invention when taken in conjunction 
with the accompanjdng drawings. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a block diagram showing a configuration example of a 
processor system which employs a semiconductor memory device according 
to a fixst embodiment of the present invention; 
5 Fig. 2 is a block diagram showing an internal configuration of the 

semiconductor memory device according to the first embodiment of the 
present invention; 

Fig. 3 is a memory map in the case where a CPU core accesses the 
semiconductor memory device as a memory mapped device; 
10 Fig. 4 is a block diagram showing a configuration of a software 

development apparatus for a processor system which employs the 
semiconductor memory device according to the first embodiment of the 
present invention; 

Fig. 5 is a block diagram showing a functional configuration of the 
15 software development apparatus according to the first embodiment of the 
present invention; 

Fig. 6 is a flow chart for describing the processing procedures of the 
software development apparatus according to the first embodiment of the 
present invention; 

20 Fig. 7A shows part of a program performed by the CPU core, Fig. 7B 

shows one example of a program extracted by an instruction extraction 
section, Fig. 7C shows address information reformatted by an address 
information reformatting section, and Fig. 7D shows part of the 
disassemble result of a module linked by a linker; 

25 Figs. 8A to 8C show an operation of the software development 

apparatus according to the first embodiment of the present invention as the 
memory images of the program to be performed; 

Fig. 9 is a block diagram showing a functional configuration of a 
software development apparatus according to a second embodiment of the 

30 present invention; 

Fig. 10 is a flow chart for describing the processing procedures of the 
software development apparatus according to the second embodiment of the 
present invention; 
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Fig. 11 shows part of a program arranged in the semiconductor 
memory device in software development using C language; 

Fig. 12 is a schematic block diagram of a semiconductor memory 
device according to a third embodiment of the present invention; 
5 Fig. 13 is a memory map in the case where a CPU core accesses the 

semiconductor memory device as a memory mapped device; 

Fig. 14 is a flow chart for describing the processing procedures of a 
software development apparatus according to the third embodiment of the 
present invention; 
10 Fig. 15 is a block diagram showing a configuration of a 

determination \mit according to a fourth embodiment of the present 
invention; 

Fig. 16 is a block diagram showing a configuration of a 
determination unit according to a fifth embodiment of the present 
15 invention; 

Fig. 17 is a flow chart for describing the processing procedures of a 
software development apparatus according to the fifth embodiment; and 

Fig. 18 shows one example of a memory map according to a sixth 
embodiment of the present invention. 
20 DESCRIPTION OF THE PREFERRED EMBODIMENTS 
First Embodiment 

Fig. 1 is a block diagram showing a configuration example of a 
processor system which employs a semiconductor memory device according 
to a first embodiment of the present invention. This processor system 
25 includes a semiconductor memory device (expressed as "STASH" in Fig. 1) 1, 
a CPU core 2, an instruction cache 3, a data cache 4, a main memory 5, and 
a USART (Universal Synchronous and Asjnachronous Receiver- 
Transmitter) 6. 

Semiconductor memory device 1 is connected between CPU core 2 
30 and instruction cache 3, and is also connected to a system bus 7. CPU core 
2 can directly fetch an instruction from semiconductor memory device 1, 
and can also fetch an instruction from instruction cache 3 through 
semiconductor memory device 1. CPU core 2 can read and write data fi:om 
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and to data cache 4. CPU core 2 can access semiconductor memory device 
1 through system bus 7. 

Instruction cache 3 and data cache 4 are connected to system bus 7 
through a bridge 8. In addition, main memory 5 and various types of 
5 functional blocks such as USART 6 are connected to system bus 7. In Fig. 
1, USART 6 is illustrated to represent the various functional blocks. In 
Fig. 1, all functional blocks are realized within a single LSI (Large Scale 
Integrated Circuit) chip. Alternatively, part of the function of the block 
may be realized by separate LSI chip or semiconductor memory device 1 

10 may be directly connected to bridge 8. 

Fig. 2 is a block diagram showing an internal configuration of 
semiconductor memory device 1 according to the first embodiment of the 
present invention. Semiconductor memory device 1 includes an address 
table 11, a determination unit 12, an instruction buffer 13, a selector 14 

15 which selectively outputs an instruction code outputted from instruction 
buffer 13 and an instruction code outputted from instruction cache 3, a 
switch 1 (hereinafter, abbreviated as "SWl'O 15, a switch 2 (hereinafter, 
abbreviated as "SW2") 16, and an inverter 17. 

Instruction bxiffer 13 is a randomly accessible memory device, and 

20 stores part of consecutive instructions performed by CPU core 2. 

Address table 11 includes a valid bit storage region 111, a maximum 
address storage region 112 and a minimum address storage region 113. 
Maximum address region 112 stores the maximum value of addresses 
corresponding to consecutive instructions stored in instruction bviffer 13. 

25 Minimum address region 113 stores the minimum value of addresses 

corresponding to the consecutive instructions stored in instruction bxiffer 13. 
Valid bit region 111 stores a bit indicating whether address information 
stored in address table 11 is valid or invalid. 

Determination unit 12 determines whether to select the instruction 

30 code outputted from instruction buffer 13 or the instruction code outputted 
from instruction cache 3 while referring to addresses stored in address 
table 11 and the address outputted from CPU core 2. Determination unit 
12 includes subtracters 121 and 122, a difference register 1 (123) which 
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stores the subtraction resiilt of subtracter 121, a difference register 2 (124) 
which stores the subtraction resxdt of subtracter 122, an inverter 125, and a 
three-input AND circxiit 126. 

Subtracter 121 subtracts the maximum address stored in region 112 
5 from an address outputted from CPU core 2, and stores the subtraction 
result in difference register 1 (123). Subtracter 122 subtracts the 
minimum address stored in region 113 from the address outputted from 
CPU core 2, and stores the subtraction resiilt in difference register 2 (124). 
AND circxiit 126 performs an AND operation among the value of 

10 valid bit region 111, the sign bit (MSB: Most Significant Bit) of difference 
register 1 (123), and the inverted value of the sign bit of difference register 
2 (124), thereby determining whether or not the addresses outputted from 
CPU core 2 fall within a predetermined range. Namely, in the case where 
"minimum address 113 < instruction address", "instruction address < 

15 maximum address 112" and "valid bit = 1", then AND circmt 126 

determines instruction address hit and outputs high level (hereinafter, 
abbreviated as "H level"). 

In the case of determining that the instruction address hits, AND 
circmt 126 outputs H level, whereby selector 14 is made select the 

20 instruction code outputted from instruction buffer 13, and SW2 (16) is 

turned on to output the value (the address of instruction buffer 13) from 
difference register 2 (124) to instruction buffer 13. At the same time, since 
address output to instruction cache 3 is unnecessary, SWl (15) is turned off. 
In the case of determining the instruction address fails to hit, AND 

25 circuit 126 outputs low level (hereinafter, abbreviated as "L level"), 

whereby selector 14 is made select the instruction code outputted from 
instruction cache 3. At the same time, since address output to instruction 
buffer 13 is unnecessary, SW2(16) is turned off and SWl (15) is turned on 
to output the instruction address to instruction cache 3. 

30 Fig. 3 is a memory map in the case where CPU core 2 accesses 

semiconductor memory device 1 as a memory mapped device. A control 
register "STASH_STS" is a register which indicates the operation state 
(active/inactive) of semiconductor memory device 1. A control register 
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''STASH_CNT' is a register which sets the operation state of semiconductor 
memory device 1. 

In the memory map shown in Fig. 3, address spaces shown as 
"address table" and "instruction buffer" are maps of instruction bviffer 13 
5 and address table 11 shown in Fig. 2. In this embodiment, each address 
space is a 4-K byte region. 

In these address spaces, data or instructions are readable/writable 
only when semiconductor memory device 1 is inactive. By setting a value 
in control register "STASH_CNT", these address spaces can be turned into 

10 inactive states or active states. In the case where semiconductor memory 
device 1 is inactive, then, determination unit 12 does not operate, 
semiconductor memory device 1 is bypassed, and CPU core 2 is directly 
connected to instruction cache 3. 

Maximum value 112 is set at Ox4000_1000 address and minimum 

15 value 113 is set at Ox4000_1004 address in address table 11. Right after 
semiconductor memory device 1 is reset, address table 1 1 is initialized to 
"0". When a value other than "0" is set to address table 11, valid bit region 
111 of address table 11 is automatically set. 

Fig. 4 is a block diagram showing the configuration of a software 

20 development apparatus for a processor system which uses semiconductor 
memory device 1 according to the first embodiment of the present. This 
software development apparatus includes a computer main body 21, a 
display unit 22, an FD (Flexible Disk) drive 23 to which an FD 24 is 
attached, a keyboard 25, a mouse 26, a CD-ROM (Compact Disk-Read Only 

25 Memory) device 27 to which a CD-ROM 28 is attached, and a network 
communication device 29. 

A program for realizing the software development apparatus 
(hereinafter, which program will be referred to as "software development 
program") is suppUed by a recording medium such as FD 24 or CD-ROM 28. 

30 By allowing computer main body 2 1 to perform the software development 
program, software development is carried out. Alternatively, the software 
development program may be suppHed to computer main body 21 fi:om 
another computer via network communication device 29. 
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Computer main body 21 includes a CPU 30, a ROM (Read Only 
Memory) 31, a RAM (Random Access Memory) 32, and a hard disk 33. 
CPU 30 performs processings while inputting and outputting data to and 
from display unit 22, FD drive 23, keyboard 25, mouse 26, CD-ROM device 
5 27, network communication device 29, ROM 31, RAM 32 or hard disk 33. 
The software development program recorded on either FD 24 or CD-ROM 
28 is stored in hard disk 33 by CPU 30 through FD drive 23 or CD-ROM 
device 27. CPU 30 appropriately loads the software development program 
from hard disk 33 to RAM 32 and performs the program, thereby carrying 

10 out the software development. 

Fig. 5 is a block diagram showing a functional configuration of the 
software development apparatus according to the first embodiment of the 
present invention. This software development apparatus includes: a 
source file storage section 101 which stores a source file (assembler source); 

15 an instruction extraction section 102 which extracts consecutive 

instructions (program) arranged in semiconductor memory device 1; an 
address information acquisition section 103 which acquires address 
information on which the consecutive instructions extracted by instruction 
extraction section 102 is originally mapped; an address information 

20 reformatting section 104 which reformats the address information acquired 
by address information acqxiisition section 103 into a form compatible with 
an assembler source; an assembler 105; and a linker 106, 

Fig. 6 is a flow chart for describing the processing procedures of the 
software development apparatus according to the first embodiment of the 

25 present invention. In Fig. 6, it is assumed to carry put program 

development using an assembly language. First, a programmer designates 
the location of a program to be arranged in semiconductor memory device 1 
(Sll). 

Fig. 7A shows part of the program performed by CPU core 2. The 
30 location of the program arranged in semiconductor memory device 1 is 

indicated by a portion surrounded by "_stashl_top" and "_stashl_end" in a 
fiinction "_funcl". Herein, "„stashl_top" and "_stashl_end" indicate 
reserved S3anbols. 
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Next, instruction extraction section 102 extracts, as a script program, 
the program arranged in semiconductor memory device 1 &om the 
assembler source stored in source file storage section 101 (S12). 

Fig. 7B shows one example of the program extracted by instruction 
5 extraction section 102. Part of the program between "_stashl_top" and 
"_stashl_end'' is extracted. 

Next, assembler 105 assembles the original program stored in source 
file storage section 101. Linker 106 links the original program assembled 
by assembler 105 (S13). Since the assembly and link are similar to those 
10 for ordinary software development using an assembly language, they will 
not be described herein in detail. It is noted that linker 106 links the 
program so that the program can be linked again for processings to be 
described later. 

Address information acquisition section 103 acquires the address 

15 information on which the consecutive instructions arranged in 

semiconductor memory device 1 are originally mapped, from the original 
program linked by linker 106 (S14). Specifically, address information 
acquisition section 103 acquires the address information on reserved 
symbols "_stashl_top" and "_stashl_end" by disassembling the program 

20 hnked by linker 106 or extracting map information. 

Assembler 105 assembles the consecutive instructions extracted by 
instruction extraction section 102 (S15). This assembly is similar to the 
assembly of a program described in an ordinary assemble language, it will 
not be described herein in detail. 

25 Address information reformatting section 104 reformats the address 

information acquired by address information acquisition section 103 into a 
form compatible with the assembler source (816). Fig. 7C shows the 
address information reformatted by address information reformatting 
section 104. As shown Fig. 7C, the top address and end address of the 

30 program arranged in semiconductor memory device 1 are described on the 
second line. 

Next, assembler 105 assembles the address information reformatted 
by address information reformatting section 104 (Sl7). Since this 
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assembly is similar to the assembly of the program described in the 
ordinary assemble language, it will not be described herein in detail. 

Finally, Hnker 106 links object files generated in steps Sl3, S15 and 
S17 with one another, and generates a load modiile (Sl8). Fig. 7D shows 
5 part of the disassemble result of the modxde linked by linker 106. As 

shown in Fig. 7D, the consecutive instructions arranged in semiconductor 
memory device 1 and the address information corresponding to the 
consecutive instructions are stored in regions starting at address 
0x4000200 and at address 0x40001000, respectively. 

10 Figs. 8A to 8C show the operation of the software development 

apparatus according to the first embodiment of the present invention as the 
memory images of the program to be performed. Fig. 8A shows the 
memory image corresponding to a load module generated by using a 
standard development tool. 

15 Fig. 8B shows the memory image corresponding to the load module 

generated by the software development apparatus in this embodiment. 
The consecutive instructions extracted by instruction extraction section 102 
(extracted consecutive instructions) are linked with an initiahzation section. 
The extracted consecutive instructions are set into semiconductor memory 

20 device 1 as shown in Fig. 8C by allowing the program included in the 
initialization section to be performed by CPU core 2. While the 
initialization section is linked with the load module, a function 
corresponding to the load modxde can be also realized by the loader of an 
OS (Operating System). 

25 As described above, according to the semiconductor memory device of 

this embodiment, part of the consecutive instmctions performed by CPU 
core 2 are stored in instruction buffer 13, and CPU core 2 fetches the 
consecutive instructions stored in instruction buffer 13. By storing the 
consecutive instructions which are frequently executed in instruction buffer 

30 13, it is possible to decrease the frequency of replacement of the instruction 
cache to thereby decrease the number of execution cycles, and to improve 
CPU processing performance. In addition, since it is possible to decrease 
the frequency of access to the instruction cache, it is possible to improve 
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power efficiency. 

Furthermore, since semiconductor memory device 1 also acts as a 
memory mapped device, even a general CPU core can perform read/write of 
resources, operation mode setting, and the like. 
5 Moreover, the operation mode is set so that when the possibility of 

access to semiconductor memory device 1 is low, semiconductor memory 
device 1 is bypassed and CPU core 2 is directly connected to instruction 
cache 3. With this configuration, it is possible to suppress the power 
consumption of semiconductor memory device 1. As a result, it is possible 

10 to decrease an increase in power consumption which may occur when 

semiconductor memory device 1 is introduced. In addition, the rewrite 
(including initialization) of the content of instruction buffer 3 can be carried 
out only when semiconductor memory device 1 is inactive. Therefore, 
exclusive control accompansdng the resource rewrite can be realized 

15 without adding a new mechanism. 

Moreover, in suppl3rLng the instruction address to instruction buffer 
13 or instruction cache 3, the supply of the instruction address to the 
unnecessary buffer or cache is stopped. It is thereby possible to suppress 
the unnecessary state transition of the memory by the change of the 

20 address bus and to decrease power consumption. 

Further, according to the software development apparatus in this 
embodiment, the software corresponding to semiconductor memory device 1 
in this embodiment can be developed only by adding the function of 
extracting the consecutive instructions arranged in semiconductor memory 

25 device 1 to an existing software development tool. In other words, it is 

possible to realize the software development apparatus in this embodiment 
only by the simple extension of the existing software development tool. 

The processor system which employs semiconductor memory device 1 
in this embodiment is guaranteed of the access cycle for semiconductor 

30 memory device 1. Therefore, by allowing a software programmer to use 
the software development apparatus in this embodiment, it is possible to 
develop software having high real-time characteristics. 

Additionally, the software development apparatus in this 
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embodiment can be realized only by utilizing a standard tool chain 
including the assembler and the linker and adding several filter programs. 
It is, therefore, possible to easily create a software development program. 
Accordingly, it is possible to easily utilize the functions of semiconductor 
5 memory device 1 in this embodiment. 
Second Embodiment 

A configuration of a software development apparatus according to a 
second embodiment of the present invention is similar to that of the 
software development apparatus according to the first embodiment shown 

10 in Fig. 4. Therefore, repetitive constituent elements and functions of the 
apparatus will not be repeatedly described herein in detail. 

Fig. 9 is a block diagram showing a fimctional configuration of the 
software development apparatus according to the second embodiment of the 
present invention. The software development apparatus shown in Fig. 9 

15 differs from that shown in Fig. 5 only in that a compiler 107 and an object 
file reformatting section 108 which converts and reformats an object file 
generated by compiler 107 into a form of the assembler source. The 
repetitive constituent elements and functions of the apparatus will not be, 
therefore, described herein in detail. 

20 Fig. 10 is a flow chart for describing the processing procedures of the 

software development apparatus according to the second embodiment of the 
present invention. Among the processing procedures of the software 
development apparatus in this embodiment, step Sll in the flow chart 
shown in Fig. 6 is replaced by steps S21 to S23 shown in Fig. 10. 

25 Therefore, the repetitive processing procedures will not be described herein 
in detafl. In this embodiment, it is assumed to carry out program 
development using a high-level language such as C language. 

First, a programmer designates, firom an original source file in the 
high-level language, the location of a program to be arranged in 

30 semiconductor memory device 1 (S21). 

Fig. 11 shows part of the program arranged in semiconductor 
memory device 1 in the software development using the C language. The 
location of the program to be arranged in semiconductor memory device 1 is 
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indicated by a portion surrounded by "#pragma_stashl_top" and 
**#pragnia_stashl_end'\ Herein, "#pragma_stashl_top'' and 
*'#pragma_stashl_end" indicate reseirved control character strings. 

Next, compiler 107 compiles the high-level language program and 
5 generates an object file (S22). Since this compiling is similar to that in 
ordinary software development using a high-level language, it will not be 
described herein in detail. 

Object file reformatting section 108 disassembles the object file 
generated by compiler 107 together with source line information. The 

10 disassembly result, which includes information corresponding to the 
assembly language-level program generated by compiler 107, can be 
converted into the form of the assembler source shown in Fig. 7A (S23). 
The assembler source thus reformatted and generated is subjected to the 
processings in step S12 and the following steps shown in Fig. 6. 

15 As described above, according to the software development apparatus 

in this embodiment, the high-level language program is compiled to 
generate the object file and, then, the object file is disassembled and 
reformatted. Therefore, in addition to the advantages described in the 
first embodiment of the present invention, it is possible to carry out 

20 software development corresponding to the processor system which uses 
semiconductor memory device 1 by even the program using a high-level 
language. 
Third Embodiment 

A semiconductor memory device according to a third embodiment of 

25 the present invention can arrange two sets of programs therein. Since a 
semiconductor memory device which arranges therein three or iriore sets of 
programs can be easily estimated from the semiconductor memory device in 
this embodiment, it will not be described herein in detail. 

Fig. 12 is a block diagram showing a schematic configuration of the 

30 semiconductor memory device according to the third embodiment of the 

present invention. Semiconductor memory device la includes an address 
table 1 (41), an address table 2 (42), determination units 43 and 44, an 
instruction bxiffer 1 (45), an instruction buffer 2 (46), and a selector 47. 
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Address table 1 (41) and address table 2 (42) are equal in 
configuration and function to address table 11 shown in Fig. 2. 
Determination units 43 and 44 are equal in configuration and fiinction to 
determination unit 12 shown in Fig. 2. In addition, instruction bviffer 1 
5 (45) and instruction buffer 2 (46) are equal in configuration and fiinction to 
instruction bxiffer 13 shown in Fig. 2. 

In the case where an instruction address outputted from CPU core 2 
hits on an address stored in address table 1 (41), selector 47 selects an 
instruction code outputted firom instruction buffer 1 (45) and outputs the 

10 selected instruction code to CPU core 2. In the case where the instruction 
address outputted from CPU core 2 hits on an address stored in address 
table 2 (42), selector 47 selects an instruction code outputted from 
instruction buffer 2 (46) and outputs the selected instruction code to CPU 
core 2. Further, in the case where the instruction address outputted from 

15 CPU core 2 does not hit on either an address stored in address table 1 (41) 
or an address stored in address table 2 (42), selector 47 selects an 
instruction code outputted from instruction cache 3 and outputs the 
selected instruction code to CPU core 2. 

Fig. 13 is a memory map in the case where CPU core 2 accesses 

20 semiconductor memory device la as a memory mapped device. The 

memory map shown in Fig. 13 differs firom that shown in Fig. 3 only in that 
two "address tables" and two "instruction buffers** are mapped and each 
table or buffer is a 4-K byte region. 

Fig. 14 is a flow chart for describing the processing procedures of the 

25 software development apparatus according to the third embodiment of the 
present invention. First, a disassembler disassembles an object file (a.o) 
51 of a software performed by a processor system including semiconductor 
memory device la in this embodiment (S31). 

Next, common consecutive instructions which appear in a plurality of 

30 locations are detected from the source file generated by the disassembler 
(S32). The detected common consecutive instructions and address 
information related to the common consecutive instructions are assembled, 
and object files (buf.o, at.o) 53 and 55 are generated, respectively. 
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Finally, generated object files (bxif.o, at.o) 53 and 55 are linked with 
original object file (a.o) 51 and a routine (stash. o) 54 for initializing 
semiconductor memory device 1, thereby generating a load module (a.abs) 
56. It is noted that the software development tool of target CPU core 2 can 
5 be utilized as the disassembler and the linker as it is. 

In addition, to detect the common consecutive instructions, hardware 
confi.guration information such as size information 52 on target 
semiconductor memory device 1 is used as parameters, and the common 
consecutive instructions with such a size as to be able to be stored in 
10 semiconductor memory device 1 are extracted firom instruction patterns 

which repeatedly appear. The extracted common consecutive instructions 
are arranged in instruction buffer 1 (45) and instruction buffer 2 (46), 
respectively. 

As described above, according to semiconductor memory device 1 in 

15 this embodiment, a plurality of common consecutive instructions can be 
arranged in semiconductor memory device 1. Therefore, it is possible to 
attain the same advantages as those described in the first embodiment in 
relation to the common program part which performs the same operation, 
and to improve the versatiHty of semiconductor memory device 1. 

20 Fourth Embodiment 

A semiconductor memory device according to a fourth embodiment of 
the present invention differs firom the semiconductor memory device in the 
first embodiment shown in Fig. 2 only by the configuration and function of 
the determination unit. Therefore, the repetitive configurations and 

25 functions of the semiconductor memory device in this embodiment will not 
be described herein in detail. The determination unit in this embodiment 
will be described while being denoted by reference ssnnbol 12\ 

Fig, 15 is a block diagram showing a configuration of determination 
unit 12' according to the fourth embodiment of the present invention. 

30 Deternnnation unit 12' includes a comparator 61, subtracters 62 and 63, a 
difference register 1 (64) which stores the subtraction resxilt of subtracter 
62, a difference register 2 (65) which stores the subtraction result of 
subtracter 63, an inverter 66, and a two-input AND circuit 67. 
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In the case where a value is set in vahd bit region 111, comparator 
61 compares upper 19 bits of an address outputted from CPU core 2 with 
upper 19 bits of a maximum value of the address stored in region 112. In 
the case where the both upper 19 bits of the addresses coincide with one 
5 another, comparator 61 enables subtracters 62 and 63. In the case where 
they do not coincide one another, comparator 61 disables subtracters 62 
and 63. 

Subtracter 62 subtracts lower 13 bits of the maximum value of the 
address stored in region 112 from lower 13 bits of the address outputted 
10 from CPU core 2, and stores the subtraction result in difference register 1 

(64) . Subtracter 63 subtracts lower 13 bits of a minimum value of the 
address stored in region 113 from lower 13 bits of the address outputted 
from CPU core 2, and stores the subtraction result in difference register 2 

(65) . 

15 AND circuit 67 performs an AND operation between the sign bit of 

difference register 1 (64) and the inverted value of the sign bit of difference 
register 2 (65), thereby determining whether or not the lower 13 bits of the 
address outputted from CPU core 2 fall within a predetermined range. In 
the case of determining that the lower 13 bits of the instruction address hit, 

20 AND circxiit 67 outputs H level. 

As described above, according to semiconductor memory device 1 in 
this embodiment, comparator 61 compares the upper 19 bits of the address 
outputted from CPU core 2 with the upper 19 bits of the maximum address 
stored in region 112, and controls subtracters 62 and 63 in accordance with 

25 the comparison result. It is therefore possible to decrease the number of 
bits of subtracters 62 and 63, difference register 1 (64) and difference 
register 2 (65), to thereby decrease the circuit scale of determination unit 
12' and to decrease the power consumption of semiconductor memory device 
1. 

30 Fifth Embodiment 

A semiconductor memory device according to a fifth embodiment of 
the present invention differs from the semiconductor memory device 
according to the first embodiment shown in Fig. 2 only by the configuration 
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and function of the determination unit. Therefore, the repetitive 
configurations and functions of the semiconductor memory device in this 
embodiment will not be described herein in detail. The determination unit 
in this embodiment will be described while being denoted by reference 
5 symbol 12". 

Fig. 16 is a block diagram showing a configuration of determination 
unit 12" according to the fiiffch embodiment of the present invention. 
Determination unit 12" includes subtracters 71 and 72, a difference 
register 1 (73) which stores the subtraction result of subtracter 71, a 

10 difference register 2 (74) which stores the subtraction result of subtracter 
72, an inverter 75, and a three-input AND circuit 76. 

In this embodiment, a program arranged in instruction buffer 13 is 
limited into a basic block. That is, the program is limited so that 
consecutive instructions arranged in instruction buffer 13 do not include a 

15 branch instruction and branching to the consecutive instructions arranged 
in instruction buffer 13 is not performed. 

By giving such a limitation to a software programmer, an instruction 
address is always incremented when the CPU fetches an instruction code 
from instruction bxiffer 13. In other words, once there is an instruction 

20 address hit, the value of MSB of difference register 2 (74) has no change 
until the next miss-hit generates. Thus, the operation of subtracter 72 
which subtracts minimum value 113 firom instruction address outputted 
firom CPU core 2 is stopped until the next miss-hit generates. 

AND circxiit 76 performs an AND operation between the sign bit of 

25 difference register 1 (73) and the inverted value of the sign bit of difference 
register 2 (74), thereby determining whether or not the address outputted 
firom CPU core 2 falls within a predetermined range. In the case of 
determining instruction address hit, AND circmt 76 outputs H level, and 
disables subtracter 72 to stop the operation of subtracter 72. In the case of 

30 determining instruction address miss-hit, AND circuit 76 outputs L level, 
and enables subtracter 72 to start the operation thereof. 

In addition, the lower 12 bits of the instruction address outputted 
firom CPU core 2 are supplied to instruction bxiffer 13. 
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Fig. 17 is a flow chart for describing the processing procedures of the 
software development apparatus according to the fifth embodiment of the 
present invention. Among the processing procedures of the software 
development apparatus in this embodiment, step S15 in the flow chart 
5 shown in Fig. 6 is replaced by steps S31 to S33 shown in Fig. 17. 

Therefore, the repetitive processing procedures will not be described herein 
in detail. 

Address information acquisition section 103 acquires the lower 12 
bits of the address of a leading instruction firom the address information 

10 acquired in step S14 (S31). The consecutive instructions program) 

extracted by instruction extraction section 102 is shifted in instruction 
buffer 13 by as much as the number of bytes corresponding to the lower 12 
bits of the address of the leading instruction. In this way, the lower 12 
bits of the instruction addresses outputted from CPU core 2 are matched to 

15 those of the address in instruction buffer 13. 

Next, assembler 105 assembles the consecutive instructions 
extracted by instruction extraction section 102 (S33), and moves to a 
processing in step Sl6 shown in Fig. 6. 

As described above, by giving the limitation to the software 

20 programmer at the time of program creation, it is possible to facilitate 
determination by determination unit 12" and to stop the operation of 
subtracter 72 while there is instruction address hit. It is, therefore, 
possible to decrease the power consumption of semiconductor memory 
device 1. 

25 Sixth Embodiment 

A brea]q)oint function which is widely utilized in software debugging 
is normally realized by replacing an instruction corresponding to a 
breakpoint by an exception instruction. However, in semiconductor 
memory device 1 according to the first embodiment, it is impossible to 

30 rewrite instruction buffer 13 or the like while the processor system operates 
and to set an exception instruction into instruction bxiffer 13. Thus, 
semiconductor memory device 1 has a disadvantage of making program 
debugging dif&cult. In this embodiment, this disadvantage is solved and 
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program debugging can be efficiently performed. 

A configuration of a processor system which employs the 
semiconductor memory device according to the sixth embodiment of the 
present invention differs fi*om a processor system which employs the 
5 semiconductor memory device according to the fifth embodiment only in 
that a semiconductor memory device reserved region is mapped on a 
memory map. Therefore, the repetitive constituent elements and functions 
of the semiconductor in this embodiment will not be described herein in 
detail. 

10 Fig. 18 shows one example of the memory map according to the sixth 

embodiment of the present invention. A semiconductor memory device 
reserved region equal in size to instruction buffer 13 of semiconductor 
memory device 1 is set in main memory 5. 

The processing procedures of the software development apparatus 

15 according to the sixth embodiment of the present invention are the same as 
those in the fifth embodiment except for the processing of step S18. 
Therefore, repetitive processing procedures will not be described herein in 
detail. In this embodiment, the processing procedures will be described 
while step SIS is replaced by a step SIS'. 

20 In step SIS', Hnker 106 sets a branch instruction to a top address in 

the region of main memory 5 corresponding to the consecutive instructions 
arranged in instruction bufier 13. The destination of the branch 
instruction is set to a top address in the semiconductor memory device 
reserved region shown in Fig. 18. In addition, a branch instruction is 

25 added to the end of the consecutive instructions extracted by instruction 
extraction section 102 (extracted consecutive instructions) and set in the 
semiconductor memory device reserved region. The destination of the 
branch instruction is set to a location right after an end address in the 
region of main memory 5 corresponding to the extracted consecutive 

30 instructions. 

These processing procedures wiU be described more specifically on 
the basis of on the program shown in Figs. 7A to 7D. First, by the first 
half processing of step SIS', the branch instruction to OxffOOO is set at 
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address 0x264. By the second half processing thereof, the branch 
instruction to 0x26c is added to a location right after the extracted 
consecutive instructions shown in Fig. 7B, and the resultant consecutive 
instructions are set at address OxHOOO. 

In this way, when the debugging initialization section is linked, the 
extracted instruction main body other than the branch instruction is 
performed by fetching the instructions from the semiconductor memory 
device reserved region of main memory 5. Since this region is located on 
main memory 5, it is possible to set a breakpoint. In other words, if only 
the initialization section is utilized as a different program while leaving the 
program main body and the extracted consecutive instructions unchanged, 
it is possible to debug the software by using the algorithms of the program 
main body and the extracted consecutive instructions as they are. 

As described above, according to the processor system in this 
embodiment, the semiconductor memory device reserved region is provided 
on main memory 5 and the extracted consecutive instructions are arranged 
in the semiconductor memory device reserved region. It is therefore 
possible to set a breakpoint, and to facilitate software debugging. 

Although the present invention has been described and illustrated in 
detail, it is clearly understood that the same is by way of illustration and 
example only and is not to be taken by way of limitation, the spirit and 
scope of the present invention being limited only by the terms of the 
appended claims. 
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